Add LLM Assistant Functionality for Scoring #39

rnichi1 · 2025-02-06T08:52:23Z

This PR is part of the Graded-By-AI Master thesis and allows for scoring through LLMs.

The relevant changes are:

Adding llm configuration through toml files in exercises
CourseService calling the separate LLM service to score submissions
Adding Database Migrations for the relevant LLM fields

llmModel is now used for more granular control over which model to use.

src/main/kotlin/ch/uzh/ifi/access/model/Task.kt

src/main/kotlin/ch/uzh/ifi/access/service/CourseConfigImporter.kt

sealexan · 2025-02-11T08:26:53Z

src/main/kotlin/ch/uzh/ifi/access/service/CourseService.kt

+
+            // Polling loop
+            var attempts = 0
+            val maxAttempts = 20  // Adjust as needed


Can you make some recommendation for a reasonable range of values from your experience?

A call to the service can take up to 2-3 seconds for longer messages. I think 20 should be enough for most cases, but I can imagine in the case of an exam with more student a higher number would be better (maybe 30 or 40). Then it polls for around 1 minute until it gives up, which should only be an issue if more than 30 students submit at the exact same time and the model has high load so it's slower. From my experience GPT-4o is a bit slower than smaller models like 4o-mini.
In the end this is just an upper limit, it does not make a difference if we set this to 100, the service will just try longer to check the status of the evaluation task.

sealexan · 2025-02-11T08:29:52Z

src/main/kotlin/ch/uzh/ifi/access/service/CourseService.kt

+                val assistantResponse = evaluateSubmissionWithAssistant(
+                    AssistantDTO(
+                        question = task.llmPrompt ?: task.instructions ?: "No instructions provided",
+                        answer = studentCode,


This I don't understand. It appears that the LLM service is provided with submission.files, which will include both code and the text answer file? I thought it should contain only the text answer (since this is what the rubrics etc. are targetting). Or is it better to include the student's code? Wouldn't it confuse the model if the code is wrong? Also, I think the ACCESS frontend will need to send the text answer separately (i.e. not as part of submission.files, but for example a newly added submission.textAnswer, which we need to add to the Submission model, DTO and frontend? As in, the frontend needs to:

check if the task involves a text answer

remove the text answer from the files submitted as "regular submission files"

add the text answer as an extra llm text answer field to the submission

Yes, this is a mistake. I now changed it to only take the file that is specified by file path in the config. I think this way we can always take the relevant file and send only that.

src/main/kotlin/ch/uzh/ifi/access/service/CourseService.kt

src/main/kotlin/ch/uzh/ifi/access/model/Task.kt

sealexan · 2025-02-11T09:01:56Z

src/main/java/ch/uzh/ifi/access/config/ModelMapperConfig.java

        modelMapper.typeMap(TaskDTO.class, Task.class)
-                .addMappings(mapping -> mapping.skip(TaskDTO::getFiles, Task::setFiles));
+                .addMappings(mapper -> {
+                    mapper.skip(Task::setLlmSubmission);


why was it necessary to skip all these? Normally, CourseConfigImporter can rely on the ModelMapper to take care of most fields, but you manually implemented the mapping in CourseConfigImporter instead. Is that necessary?

This is because the Task.class contains each field separately while the TaskDTO contains the llm config in one single DTO object.

src/main/kotlin/ch/uzh/ifi/access/service/CourseService.kt

rnichi1 added 4 commits February 2, 2025 17:12

Add llm correction

b861e47

Merge branch 'main' of https://github.com/rnichi1/Access-LLM-Backend

ebaef49

Add missin Bean to SecurityConfig

4dc25c4

Add llm family as replacement for model

7a9cd39

llmModel is now used for more granular control over which model to use.

sealexan assigned rnichi1 Feb 13, 2025

sealexan reviewed Feb 13, 2025

View reviewed changes

rnichi1 added 2 commits February 15, 2025 11:50

change example format and take correct submission for llm

8093e6d

Add a guard to not always run assistant scoring

3d9cd71

rnichi1 requested a review from sealexan February 15, 2025 22:42

rnichi1 added 2 commits February 18, 2025 21:01

fix: import of courses

60d48b0

fix: send example points as string

9158284

sealexan force-pushed the main branch from 8fb2286 to 931bfd3 Compare August 15, 2025 14:21

sealexan force-pushed the main branch from 7504385 to 1892736 Compare September 3, 2025 02:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add LLM Assistant Functionality for Scoring #39

Add LLM Assistant Functionality for Scoring #39

Uh oh!

rnichi1 commented Feb 6, 2025

Uh oh!

Uh oh!

Uh oh!

sealexan Feb 11, 2025

Uh oh!

rnichi1 Feb 14, 2025

Uh oh!

sealexan Feb 11, 2025

Uh oh!

rnichi1 Feb 15, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sealexan Feb 11, 2025

Uh oh!

rnichi1 Feb 15, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add LLM Assistant Functionality for Scoring #39

Are you sure you want to change the base?

Add LLM Assistant Functionality for Scoring #39

Uh oh!

Conversation

rnichi1 commented Feb 6, 2025

Uh oh!

Uh oh!

Uh oh!

sealexan Feb 11, 2025

Choose a reason for hiding this comment

Uh oh!

rnichi1 Feb 14, 2025

Choose a reason for hiding this comment

Uh oh!

sealexan Feb 11, 2025

Choose a reason for hiding this comment

Uh oh!

rnichi1 Feb 15, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sealexan Feb 11, 2025

Choose a reason for hiding this comment

Uh oh!

rnichi1 Feb 15, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants